Using Smaller Constituents Rather Than Sentences in Active Learning for Japanese Dependency Parsing

نویسندگان

  • Manabu Sassano
  • Sadao Kurohashi
چکیده

We investigate active learning methods for Japanese dependency parsing. We propose active learning methods of using partial dependency relations in a given sentence for parsing and evaluate their effectiveness empirically. Furthermore, we utilize syntactic constraints of Japanese to obtain more labeled examples from precious labeled ones that annotators give. Experimental results show that our proposed methods improve considerably the learning curve of Japanese dependency parsing. In order to achieve an accuracy of over 88.3%, one of our methods requires only 34.4% of labeled examples as compared to passive learning.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An improved joint model: POS tagging and dependency parsing

Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...

متن کامل

Combining Active Learning and Partial Annotation for Japanese Dependency Parsing

The machine learning-based approaches that dominate natural language processing research require massive amounts of labeled training data. Active learning has the potential to substantially reduce the human effort needed to prepare this data by allowing annotators to focus on only the most informative training examples. This paper shows how active learning can be used for domain adaptation of d...

متن کامل

تأثیر ساخت‌واژه‌ها در تجزیه وابستگی زبان فارسی

Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...

متن کامل

E ectiveness of Prosodic Information in Dependency Analysis of Japanese Sentences

This paper is concerned with measuring the amount of syntactic information contained in prosodic features of read Japanese sentences. Five prosodic features were chosen, and statistical relation between those features and interphrase dependency distance was estimated from a speech database. Then a number of experiments on dependency analysis of Japanese sentences were conducted with the minimum...

متن کامل

Dependency Analysis of Read Japanese Sentences using Pause Information: A Speaker Independent Case

This paper deals with the problem of recovering syntactic structures of sentences by using the prosodic information extracted from spoken versions of the sentences. Prosodic information has proven to be effective to disambiguate syntactic structures, which is not utilized in a conventional rule-based parser. In our previous works, the duration of pauses at phrase boundaries has been found to be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010